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Abstract 

NKX3.1 is a honneobox transcription factor whose function as a prostate tunnor 
suppressor rennains insufficiently understood because neither the 
transcriptional progrann governed by NKX3.1 , nor its interacting proteins have 
been fully revealed. Using affinity purification and nnass spectronnetry, we have 
established an extensive NKX3.1 interactonne which contains the DNA repair 
proteins Ku70, Ku80, and PARR, thus providing a nnolecular underpinning to 
previous reports innplicating NKX3.1 in DNA repair. Transcriptonnic profiling of 
NKX3.1 -negative prostate epithelial cells acutely expressing NKX3.1 revealed 
a rapid and connplex response that is a near nnirror innage of the gene 
expression signature of hunnan prostatic intraepithelial neoplasia (PIN). 
Pathway and network analyses suggested that NKX3.1 actuates a cellular 
reprogrannnning toward lunninal cell differentiation characterized by suppression 
of pro-oncogenic c-MYC and interferon-STAT signaling and activation of tunnor 
suppressor pathways. Consistently, ectopic expression of NKX3.1 conferred a 
growth arrest depending on TNFa and JNK signaling. We propose that the 
tunnor suppressor function of NKX3.1 entails a transcriptional progrann that 
nnaintains the differentiation state of secretory lunninal cells and that disruption 
of NKX3.1 contributes to prostate tunnorigenesis by pernnitting lunninal cell 
de-differentiation potentially augnnented by defects in DNA repair. 
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Introduction 

NKX3.1 encodes a homeodomain transcription factor whose 
expression is largely restricted to the prostate and controlled by 
androgen. The gene is located on chromosome 8p21 in a region 
frequently deleted in early prostate cancers (reviewed in^ -). Studies 
in Nkx3.1 knockout mice have provided compelling evidence that 
Nkx3.1 is a prostate tumor suppressor^"\ These mice develop pro- 
static intraepithelial neoplasia (PIN), a precancerous lesion char- 
acterized by hyperproliferation of dysplastic cells, indicating that 
NkxS.l is haploinsufficient for PIN suppression^'. Additional studies 
showed that serial passage of PIN-like lesions from Nkx3.1 mutant 
mice can undergo progressively severe histopathological altera- 
tions^ Finally, loss of NkxS.l can cooperate with loss of Pten and 
p27 in prostate cancer development in mice^ ^ while Nkx3.1 over- 
expression inhibits cell proliferation in Pten null epithelial grafts'^. 
These data indicate that the diminished expression of NKX3.1 that 
is frequently observed in human prostate cancers"' is involved in the 
initial stage of prostate carcinogenesis. While the tumor suppressor 
function of NKX3.1 remains poorly defined at the molecular level, 
the knockout phenotypes suggested that Nkx3.1 controls genes 
involved in prostate development, differentiation, and maintenance 
of tissue integrity. 

Like other NKX class homeoproteins, NKX3.1 can function as a 
transcriptional repressor by binding a non-canonical homeodomain 
DNA motif such as naturally occurring in the mouse androgen 
receptor promoter"^ or artificially presented in synthetic reporter 
genes Transcriptional repression may involve NKX3.1 -mediated 
recruitment of co-repressors^- and the histone deacetylase, HDACP. 
A second mode of trans-repression found for the pro state- specific 
antigen (PSA) gene occurs independently of NKX3.1 promoter 
binding sites, but through repressive interaction with transcrip- 
tional activators such as SPl^^ and prostate-derived ETS factor 
(PDEF^^). NKX3.1 was also shown to activate gene transcription, 
either through direct promoter binding as in the case of PCANl and 
HK2^^'^^ or through interaction with other transcriptional activators 
such as serum response factor (SRF) or FoxAl and the androgen 
receptor (ARY^'^K 

Transcriptomic profiling combined with global mapping of > 9,500 
genomic binding sites by ChIP- sequencing revealed a set of 
282 putative direct target genes that were differentially expressed 
in young NKX3.1"^" prostates not displaying PIN^^''^'^. A subset 
of NKX3.1 target genes was also regulated by MYC with both 
transcription factors showing mutual antagonism^^\ Since overex- 
pression of Myc cooperates with loss of Nkx3.1 in mouse prostate 
tumorigenesis, maintaining proper control of the common Nkx3.1/ 
Myc target genes may be involved in Nkx3.rs tumor suppressor 
function^^. A similar study in aged mice already displaying PIN 
revealed a gene expression signature indicative of impaired response 
to oxidative stress-". Interestingly, these changes correlated with 
a 5 -fold increase in oxidative DNA damage in Nkx3.r^ prostates. 
Whether oxidative DNA damage is a direct consequence of loss of 
NKX3.1 or a secondary consequence of PIN development is unknown. 

Another key to understanding the tumor suppressor function of 
NKX3. 1 potentially lies with its protein interaction partners. Several 



have been described that modulate NKX3.rs transcriptional effects 
(e.g. SRF^^^i pDEF^^ HDACl^, SFV\ MYO\ and AR^^). In 
addition, NKX3.1 was shown to bind to and augment the activity 
of topoisomerase I, suggesting that it functions in DNA repair--'^\ 
NKX3.1 localizes to sites of DNA damage, promotes ATM and 
ATR activity, and enhances the survival of cells exposed to DNA 
damage-"^. Loss of NKX3.1 function in premalignant prostate cells 
may therefore accelerate the acquisition of DNA damage, poten- 
tially aggravated by unabated accumulation of reactive oxygen 
species thus promoting cellular transformation-^. Nevertheless, it 
is currently unclear whether the function of NKX3.1 in DNA 
repair is indirectly mediated through transcriptional effects or directly 
through physical interactions with the DNA repair machinery. 

In this report, we present an analysis of the NKX3.1 protein inter- 
actome that revealed intimate physical links of NKX3.1 with the 
DNA repair machinery, namely components of the DNA-dependent 
protein kinase (DNA-PK) holocomplex (XRCC5/Ku80, XRCC6/ 
Ku70) and poly(ADP) ribose polymerase (PARPl). In addition, 
transcriptomic profiling of immortalized prostate epithelial cells 
upon acute activation of NKX3.1 revealed a rapid and complex 
transcriptional response that is a near mirror image of the gene 
expression signature of human PIN devoid of NKX3.1. Taken 
together, these data shed new light onto the elusive tumor suppres- 
sor activity of NKX3.1, directly implicating this homeoprotein in 
DNA repair and in driving a gene expression signature indicative 
of an essential function in maintaining the differentiation state of 
luminal prostate epithelial cells. 

Materials and methods 

Tissue culture, plasmids, viruses, antibodies 

The human prostate cancer cell line LNCaP was obtained from 
ATCC and maintained in RPMI 1640 (Hyclone, Cat.# SH30027.01) 
supplemented with 10% fetal bovine serum (Sigma, Cat.# F6178- 
500ML), 50 units/ml penicillin, and 50 units/ml streptomycin 
(Thermo Scientific HyClone, Cat.# SV30010). The NKX3.1 
cDNA was amplified from LNCaP mRNA, sequence confirmed, 
and cloned into pFLAG thereby attaching three consecutive FLAG 
epitope tags to the N-terminus. For DNA transfection, LNCaP cells 
were grown to 50-70% confluence on a 150 mm dish and trans- 
fected with 30 |ig of plasmid DNA using DOTAP reagent according 
to the recommendations of the manufacturer (Roche, Indianapolis, 
IN). Immortalized human prostate epithelial cells (LH cells, kindly 
provided by Dr. W. Hahn;-'') were maintained in Prostate Epithelial 
Cell Basal Media (Lonza, Cat.# CC-3165) including growth fac- 
tors, cytokines, and supplements (PREGM Singlequots, Lonza, 
Cat.#CC-4177). 

For production of adenoviruses, the ADEASY system was used 
as previously described^^. The NKX3.1 cDNA was cloned into 
the pADTRACKl shuttle vector. The resulting plasmid was trans- 
formed into BJ-ADEASY cells by electroporation. Adenoviral 
DNA generated by recombination in BJ-ADEASY cells was iso- 
lated and transfected into 293 cells (ATCC) using standard calcium 
phosphate procedures. Virus was harvested from cells and ampli- 
fied by infection of 293 cells. Amplified virus was tittered and used 
at a multiplicity of infection of -100. 
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The following antibodies were used: Flag mouse monoclonal 
(Sigma-Aldrich Cat# F1804, RRID:AB_262044), NKX3.1 
mouse monoclonal for immunoblotting (Invitrogen Cat# 35-9700, 
RRID:AB_138690), Anti-human NKX3.1 goat polyclonal (Santa 
Cruz Biotechnology, Inc. Cat# sc- 15022, RRID:AB_650285) for 
immunoprecipitation, GFP mouse monoclonal (Clontech Cat# 632380, 
RRID:AB_10013427), actin mouse monoclonal (MP Biomedicals, 
Irvine, CA, Cat.# ICN691001), BANF rabbit polyclonal (EMD 
Millipore Cat# 09-893, RRID:AB_1977041), Ku70 mouse mono- 
clonal (GeneTex Cat# GTX23 1 14, RRID: AB_367 103), Ku80 mouse 
monoclonal (GeneTex Cat# GTX72225, RRID: AB_383445 ), MYC 
rabbit polyclonal (Epitomics Cat# 1472-1, RRID:AB_562270), 
p21 rabbit monoclonal (Cell Signaling Technology Cat# 2947S, 
RRID:AB_823586), HSPA8 rabbit polyclonal (Sigma-Aldrich 
Cat# SAB2101098, RRID:AB_10604580), PAR? mouse monoclo- 
nal (BD Biosciences Cat# 556494, RRID:AB_396433), H0XB13 
rabbit polyclonal (Invitrogen Cat# 422500, RRID :AB_ 15 00227). 

FLAG-NKX3.1 affinity purification 

Cells of one 150 mm dish transfected with pFLAF-NKX3.1 or 
empty vector were lysed in each 1 ml IP lysis buffer (50 mM Tris- 
HCl pH 7.4, 150 mM NaCl, 1% Triton X 100) on ice. Per affinity 
purification, 4 |ag FLAG M2 antibody (Sigma-Aldrich Cat# F1804, 
RRID:AB_262044) was coupled to 50 |il magnetic beads in 0.2 M 
triethanolamine, pH 8.2 and 20 mM dimethyl pimelimidate with 
rotational mixing at room temperature for 30 min. The reaction was 
stopped by resuspending beads in 1 ml 50 mM Tris, pH 7.5 for 
15 min. After five washes in IP lysis buffer, the beads were added 
to the cell lysate. Upon incubation for 4 h at 4°C, the lysate was 
removed and stored as "depleted lysates" at -20°C, whereas the 
beads were washed 5 times with 1 ml IP lysis buffer. After the final 
wash, beads were resuspended in 50 |il elution buffer (5 ^ig of triple 
FLAG peptide in PBS) and incubated at 4°C for 30 minutes with 
vortexing. The sample was analyzed by immunoblotting (10%), 
silver staining (2%), and LC-MS/MS (88%). 

Liquid chromatography and tandem mass spectrometry 
(LC-MS/MS) 

LC-MS/MS analysis of affinity purified FLAG-NKX3.1 complexes 
was performed as previously described in detaiP^ -l In brief, eluates 
were digested in solution with trypsin, and peptides were separated 
by reversed phase chromatography. Peptides were analyzed on an 
LTQ Orbitrap XL mass spectrometer (Thermo Fisher Scientific; 
San Jose, CA). The MS/MS method was top 4-data dependent. 
Dynamic exclusion was enabled. Data were searched against an 
international protein index (IPI) human protein database using Sor- 
cerer-SEQUEST (SageN Research; Milpitas, CA). 

Semi-quantitative analyses using spectral counting 

Spectral counts are the number of times an ionized peptide is selected 
by the mass spectrometer for MS/MS, in the data-dependent mode 
and provide widely accepted, semi-quantitative estimates of relative 
protein abundance^'^. QTools, which are in-house developed visual 
basic macros (available from: www.dieter-wolf-lab.org/protocols) 
for automated spectral count analysis, were used to compute spec- 
tral counts of the proteins, using the PeptideProphet output from 
the trans-proteomic pipeline (TPP; Institute for Systems Biology, 
Seattle, WA;^«). 



Post-identification protein filtering 

Purifications of FLAG-NKX3.1 were performed in quadruplicate 
(i.e. 4 biological replicates), each time starting with a fresh batch 
of cells. Altogether eight samples from affinity purifications (quad- 
ruplicates of mock and FLAG-NKX3.1) were analyzed repeatedly 
(3 times per sample, i.e. 3 technical replicates of each sample) by 
LC-MS/MS for a total of 24 LC-MS/MS runs. 

Altogether we identified 315 human proteins (Data set lA). To com- 
pile a high confidence NKX3.1 protein interactome, we first per- 
formed a background subtraction, i.e. the spectrum count obtained 
for each protein in the mock purifications was subtracted from the 
spectrum count obtained for that same protein in the corresponding 
FLAG-NKX3.1 purification (Data set IB). The subtracted spectrum 
counts were then summed over all 4 independent purifications. If 
negative values were obtained after summing (i.e. if a protein was 
consistently more abundant in the mock purification than in the 
FLAG-NKX3.1 purification), the protein was disregarded. This 
resulted in a list of 250 proteins with an average spectrum count of 
9.94 (Data set IB). From this lists of background- subtracted data, 
we removed all proteins with spectrum counts below the average 
(< 10) to exclude low-abundance proteins potentially non- specifically 
associated with NKX3.1. This resulted in a list of 71 background 
subtracted and abundance-filtered proteins. In the next step, we 
collapsed redundant protein database entries (often resulting from 
multiple protein isoforms that were not distinguished by the pep- 
tides identified by LC-MS/MS) into single entries by adding their 
spectrum counts both in the mock and NKX3.1 purifications. This 
resulted in a non-redundant list of 58 proteins, which we refer to as 
the high confidence interactome (Data set IB). 

Since spectrum counts depend on protein size (larger proteins giv- 
ing rise to more tryptic peptides), we normalized spectrum counts 
to protein molecular weights, which we have previously found to be 
an appropriate method of normalization The summed, normalized 
spectrum count numbers of all non redundant proteins were used to 
assemble the final background subtracted list of 58 NKX3.1 inter- 
acting proteins (referred to as Sum NKX3.1 - Mock). The summed 
normalized spectrum count numbers were also used to determine 
the fold enrichment of a protein in the NKX3.1 sample over mock 
(Sum NKX3.1/Mock). Both lists were sorted according to abun- 
dance and compared in Figure ID to illustrate that both methods of 
background filtering (subtraction or division) yield an overlapping 
list of high confidence NKX3.1 interactors. The spectrum count 
intensity map in Figure IC reiterates most of the steps described 
above thus presenting a comprehensive view of the analysis. 

Reactome analysis 

The NKX3.1 interactome was analyzed with the Cytoscape Reac- 
tome FI plugin'^. The list of NKX3.1 interacting protein was loaded 
into Cytoscape and used to build Reactome networks allowing linker 
genes. The networks were clustered into modules, and pathways 
enriched in the modules (FDR < 0.01) were identified (Figure 2A). 

Transcriptome analysis 

Duplicate RNA samples collected from NKX3.1 adenovirus trans- 
duced LH cells or from LH cells transduced with the GFP control 
virus were used for microarray analysis on the Illumina platform. 
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Figure 1.The NKX3.1 protein interactome. (A) Representative purification of FLAG-NKX3.1 from transfected LNCaP cells. Cell lysates were 
absorbed to anti-FLAG M2 resin, and specifically retained proteins were eluted with FLAG peptide and separated by SDS-PAGE. A band 
migrating with the expected molecular weight of FLAG-NKX3.1 and absent from the mock purification (empty vector) is highlighted. (B) Four- 
way Venn diagram to indicate the degree of overlap in the protein content detected in four independent purifications of FLAG-NKX3.1. (C) 
Map of spectrum count intensities in the four independent FLAG-NKX3.1 and mock purifications. The map also contains the sum of spectrum 
counts across all purifications as well as summed data after adjustment for protein molecular weights. The right most two columns present 
two distinct ways of background correction, either by subtracting mock values from NKX3.1 values (NKX3.1 - Mock) or by calculating the 
factor of enrichment in the NKX3.1 sample over mock (NKX3.1/Mock). See the Materials and methods section for details on data analysis and 
processing. (D) Spectrum count intensity maps of the 25 most abundant components of the NKX3.1 interactome. Data were sorted either by 
factor of enrichment (left panel, NKX3.1/Mock sorted) or by background subtracted values (right panel, NKX3.1 - Mock sorted). Black type 
font indicates the proteins occurring on both lists independent of the method of abundance-based sorting. 



The Human 6-V2 Expression BeadChips (Illumina) were used, 
which contain -46,000 transcript probes. Primary data was collected 
using the manufacturer's BeadArray Reader using the supplied 
scanner software. Data analysis was done in three stages. First, 
expression intensities were calculated for each transcript probed 
on the array for all hybridizations using Illumina' s Beadstudio#2 
software. Secondly, intensity values were quality controlled and 
normalized. Quality control was carried out by using the Illumina 
Beadstudio detection p- value set to < 0.05 as a cutoff. This removed 
probes whose signals were too low to be reliably detected on the 
array. After this step, the initial -46,000 probes were reduced to 
22,319 (Data set 2A). Measurements were then normalized using 
the normalize, quantiles routine from the Affymetrix package in 
Bioconductor. This procedure accounted for any variation in hybridi- 
zation intensity between the individual arrays. An assessment of 
several different normalization techniques using the Bioconductor 
maCorrPlot routine suggested that normalize. quantiles was the 
most appropriate for the data. Finally, these normalized data were 
imported into GeneSpring and analyzed for differentially expressed 
genes. The raw datasets were submitted to the GEO database 
(accession number GSE47030). 



To identify genes differentially expressed between LH cells infected 
with Ad-GFP and Ad-GFP-NKX3.1 the biological repUcates for 
each time point (7 h and 10 h) were averaged. Datasets were interro- 
gated for genes with statistically significant differences between the 
two groups (i.e. +/- NKX3. 1) based on the results of the Welch t-test 
(parametric test, variances not assumed equal; p-value cutoff 0.05). 
To find the genes with the most robust changes in expression, the 
data was plotted as a "Volcano Plot" (Supplementary Figure S2B), 
which allows statistical significance to be measured along with 
the extent of fold change in expression. Lists of mRNAs signifi- 
cantly changing 3-fold or 5-fold upon expression of NKX3.1 were 
assembled (Data set 2C). 

RNA isolation and Q-PCR analysis 

LH cells were infected with 20 |il of Ad-GFP or Ad-GFP-NKX3.1 
viruses and total RNA was isolated after 6, 8, 10, and 12 h using 
the RNeasy mini kit (Qiagen, Valencia, CA). RNA concentrations 
were determined by measuring absorption at 260 nm in a spec- 
trophotometer. Aliquots of 2 jig of total RNA from each sample 
were reverse-transcribed into cDNA using an Omniscript RT kit 
(Qiagen) according to the manufacturer's instructions. Quantitative 
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Figure 2. NKX3.1 interacts with DNA repair proteins. (A) The list of NKX3.1 interacting proteins was loaded into Cytoscape and used to 
build Reactome Functional Interaction networks. The networks were clustered into modules (indicated by colors), and pathways enriched in 
the modules (FDR < 0.01) were identified. Diamonds represent network components that were not identified as NKX3.1 interacting proteins. 
(B) LNCaP cells were transfected with FLAG-NKX3.1 (+) or empty vector (-) followed by absorption of cell lysate to FLAG M2 resin to 
purify FLAG-NKX3.1. Co-purifying DNA repair proteins were detected by immunoblotting. The bottom four panels are from the same affinity 
purification resolved on a separate gel. The asterisk denotes an unspecific cross-reactivity of the HSPA8 antibody. Cropped blot images are 
shown; see Figure S7 for full images. (0) A nuclear protein fraction was prepared from LNCaP cells and employed for immunoprecipitation 
with NKX3.1 antibodies or an IgG control as indicated. The same samples before ("B") and after ("A") immunoprecipitation are shown to 
document the specific depletion of endogenous NKX3.1. The bottom three panels are from the same immunoprecipitate resolved on a 
separate gel. Cropped blot images are shown; see Supplement Figure S7 for full images. 
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Real-Time PCR was performed using Brilliant SYBR Green QPCR 
Master Mix (Stratagene, La JoUa, CA) and the Mx3000 Real-Time 
PCR System (Stratagene). Gene specific primers were designed 
using the PrimerS algorithm (http://frodo.wi.mit.edu/) as shown 
below. PCR reactions were run according to the protocol for the 
BrilUant SYBR Green QPCR Master Mix. Briefly, PCR was car- 
ried out using a final concentration of 0.2 |imol of the primer pairs, 
50 ng of cDNA template and 12.5 |il of Brilliant® SYBR Green 
QPCR Master Mix. The volume was adjusted to 25 ^il by adding 
RNase-free water. The thermocycling protocol began with a 3 min 
denaturation at 95°C, a 40 cycle amplification program consisting 
of 30 s denaturation at 95°C, 1 min annealing at 55°C and 30 s 
extension at 95 °C. Upon conversion of raw ct values to linearly 
related X(0) values, expression values were normalized to GAPDH, 
and expression changes were expressed as ratios of mRNA levels 
in NKX3.1 infected versus GFP infected cells (NKX3.1/GFP). The 
ratios were log2 transformed and averaged across two technical 
replicates, and standard deviations were calculated. 

Primer sequences used for Q-PCR: 

HSPA6_F CCGTGAAGCACGCAGTGAT 

HSPA6_R ACGAGCCGGTTGTCGAAGT 

TAGLN_F GCTGGAGGAGCGACTAGTGG 

TAGLN_R CCTCCTGCAGTTGGCTG 

CDH2_F TGGAACGCAGTGTACAGAATCAG 

CDH2_R TTGACTGAGGCGGGTGCTGAATT 

CCND2_F TACCTTCCGCAGTGCTCCTA 

CCND2_R TCACAGACCTCCAGCATCCA 

STAT2_F CACCAGCTTTACTCGCACAG 

STAT2_R TGGAAGAATAGCATGGTAGCCT 

EEF1A2_F GCTGAAGGAGAAGATTGACC 

EEF1A2_R TTCTCCACGTTCTTGATGAC 

CDKN1A_F TTGTCTTTCCTGGCACTAAC 

CDKN1A_R CCCTCGAGAGGTTTACAGTC 

HES 1_F GCATCTGAGCACAGAAAGTC 

HES 1_R CTGTCATTTCCAGAATGTCC 

S 100A2_F GGGAAATGAAGGAACTTCTG 

S 100A2_R CACATGACAGTGATGAGTGC 

TNFa_Fl GTGGACCTTAGGCCTTCCTC 

TNFa_Rl ATACCCCGGTCTCCCAAATA 

TNFa_F2 CCCAGGCAGTCAGATCATCTT 

TNFa_R2 TCTCAGCTCCACGCCATT 

Measurement of cell proliferation 

LH cells were seeded in 3 84- well plates at a density of 2000 cells 
per well. After 24 hours, cells were transduced with Ad-GFP- 
NKX3.1 or control Ad-GFP adenoviruses for the times indicated in 
Figure 6D-F. Proliferation (i.e. DNA synthesis) was measured using 
the Click-iT® EdU Alexa Fluor® 594 HCS kit (Invitrogen, Carlsbad, 
CA) according to the manufacturer's instructions. Briefly, 10 |iM 
5-ethynyl-2'-deoxyuridine (EdU) was added to culture media for 
one hour, and cells were fixed with 3.7% formaldehyde, washed 
with PBS twice, permeabiHzed with 0.1% Triton X-100 in PBS, 
stained with Click-iT Alexa Fluor 594 dye, and counter stained 
with 1 jig/mL Hoechst 33342 (Blue). Plates were scanned and ana- 
lyzed by using a Celigo automated cytometer at dual wave length 
to detect Hoechst dye (total cell count) and Alexa Fluor 594 (cells 



incorporating EdU and thus undergoing DNA synthesis). Four 
images per well were obtained at each wave length, and the per- 
centage of proliferating cells was calculated by dividing the number 
of Alexa positive cells by the total cell number. 

MAP kinase inhibitors and neutralizing antibodies were added two 
hours after viral transduction. JNK inhibitors SP600125 (EMD 
Chemicals Inc, San Diego, CA) and p38 inhibitor SB203580 (Enzo 
Life Sciences, Farmingdale, NY) were used at 20 |iM. Mouse IgG 
directed against TNFa (Clone 6401, R&D Systems, Minneapolis, 
MN) and whole mouse IgG as a control (Jackson ImmunoResearch 
Laboratories, West Grove, PA) were used at 5 |ig/ml. 

Pathway and network analysis 

Ingenuity Pathway Analysis (IPA, Ingenuity Systems) was used for 
pathway and network analysis. The bulk of the analysis was per- 
formed with the 5x dataset (mRNAs showing a significant > 5 -fold 
change upon expression of NKX3.1). The 3x dataset was used for 
the MYC network. Datasets were imported into IPA, and analyzed 
with the following settings: Reference Set: Ingenuity Knowledge 
Base (Genes + Endogenous Chemicals); Network Analysis: Direct 
and Indirect Relationships; Data Source: Ingenuity Expert Find- 
ings; Confidence: Experimentally Observed; Species: Mammal 
(human, mouse, rat) and Uncategorized (e.g. chemicals); Tissue 
and Cell Lines: All. 

NextBio analysis 

The 5x dataset was uploaded to the NextBio server through the 
Sanford-Burnham portal. 153 of the 158 features of the 5x dataset 
were recognized and could be interpreted by NextBio. The analysis 
was performed using default settings. Significantly enriched tran- 
scription factor binding sites were identified through correspond- 
ing Biogroups. The overlap between the 5x dataset and the gene 
expression study by Nanni et a/. was identified through a search 
against all curated studies. 

Indirect immunofluorescence staining 

Flag-NKX3.1 transfected LNCaP cells were seeded onto 15 mm 
poly-lysine coated glass cover slips, and fixed using formaldehyde 
(3.7% in PBS). Samples were stained with mouse monoclonal 
FLAG (Sigma) or goat polyclonal NKX3.1 antibodies (Santa Cruz). 
Alexa Fluor 568 (red) donkey anti-mouse IgG and Alexa Fluor 488 
(green) donkey anti-goat IgG conjugate antibodies (Life Tech- 
nologies Cat# A10037, RRID:AB_1 11 80865 and Cat# A11055, 
RRID:AB_ 105 64074) were used as secondary antibodies. The 
nuclei were stained with 4 '-6' diamidino-2-phenylindole (DAPI). 
Samples were imaged on a Nikon Type 120 inverted fluorescent 
microscope using 60x magnification. 

Results 

The NKX3.1 interactome 

Reasoning that the NKX3.1 interactome may be most effectively 
profiled in cells that naturally express this protein, we transiently 
expressed FLAG epitope-tagged NKX3.1 in LNCaP human pros- 
tate cancer cells. FLAG-NKX3.1 was approximately 5-fold in excess 
over endogenous NKX3.1 (Supplementary Figure SI A) but localized 
primarily to cell nuclei (Supplementary Figure SIB). The proteas- 
ome inhibitor MG132 was added 4 hours prior to lysate preparation 
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in order to slow the rapid clearance via the ubiquitin-proteasome 
pathway to which NKX3.1 is normally subjected^^'^\ Cell lysate 
was absorbed to anti-FLAG M2 resin, and specifically retained 
proteins were eluted with FLAG peptide. Four independent affin- 
ity purifications were performed in parallel with mock purifications 
of lysate of cells transfected with empty vector. The eluates were 
examined by SDS-PAGE (Figure lA) and subjected to LC-MS/ 
MS analysis in order to determine their protein composition. Alto- 
gether, 315 proteins were identified at a false-positive rate of < 0.01 
(Data set lA). 

The protein dataset was subjected to background subtraction and 
abundance-based filtering to arrive at a list of 58 high confidence 
NKX3.1 interacting proteins (see Materials and methods and Data 
set IB). Fifty five of the 58 proteins were identified in at least two 
independent purifications, and 27 were identified in at least three 
purifications (Figure IB, Data set IC). Five proteins were consist- 
ently identified as NKX3.1 interaction partners in all four inde- 
pendent purifications, namely NKX3.1, the DNA repair proteins 
XRCC5/Ku80 and PARPl, and the protein synthesis proteins RPS9 
andPABPCl. 

We next performed a relative quantification of the NKX3.1 inter- 
actome based on spectral counting-*^. Upon summing the molecular 
weight adjusted spectrum counts of each protein across the four 
mock and NKX3.1 purifications, we derived background corrected 
quantifications by either subtracting summed mock values from 
summed NKX3.1 bait values (NKX3.1 - Mock) or by dividing 
NKX3.1 bait values from mock values (NKX3.1/Mock) to obtain 
the factor by which a protein was enriched in the NKX3.1 bait 
samples over the mock sample. Both methods confirmed the 
expectation that NKX3.1 was the most abundant protein identi- 
fied in the FLAG affinity purifications (Figure IC, D). We also 
performed Reactome Functional Interaction analysis to construct a 
functional interaction network of NKX3.1 binding proteins derived 
from manually curated literature data^-. The network was clustered 
into modules and enriched functional pathways/reactions were 
identified (Figure 2A). 

Among the 10 most abundant co-purifying proteins were the 
components of the DNA-dependent protein kinase (DNA-PK) 
holoenzyme, XRCC5/Ku80, XRCC6/Ku70, and poly(ADP) ribose 
polymerase (PARPl) (Figure 2A). DNA-PK and PARPl have 
important functions in DNA double strand break repair, recombina- 
tion, and telomere maintenance but are also involved in chromatin 
and transcriptional controP^"^^ For example, Ku proteins associate 
with a series of homeodomain proteins (H0XC4, OCTl, 0CT2, 
DLX2) thereby recruiting them to DNA ends where they are phos- 
phorylated by DNA-PK^'^ Such phosphorylation was proposed to 
lead to DNA damage-dependent changes in their transcriptional 
activities. ADP-ribosylation mediated by PARPl can stimulate 
the ability of DNA-PK to phosphorylate protein substrates^". 
Our interactome data provide a possible mechanism underlying 
the previously observed localization of NKX3.1 to sites of DNA 
damage-^, although the functional consequences of these interac- 
tions for NKX3.1 transcriptional activity remain to be established. 
Regardless, follow-up co-immunoprecipitation experiments showed 
that overexpressed NKX3.1 readily interacted with endogenous 
XRCC5/Ku80, XRCC6/Ku70, and PARPl (Figure 2B). Interaction 



of DNA-PK with ectopically expressed NKX3.1 was very recently 
reported in an independent study We show here that endogenous 
NKX3.1 also interacts with XRCC5/Ku80, XRCC6/Ku70, and 
PARPl (Figure 2C). 

Among the top ranking NKX3.1 interacting proteins was also inter- 
leukin enhancer binding factor 2 (ILF2/NFAT 45 kDa) (Figure ID). 
This protein was previously shown to interact with the DNA-PK- Ku 
complex^- and to be part of a ribonucleoprotein assembly con- 
taining heterogeneous nuclear ribonucleoproteins (hnRNPs), the 
heat shock protein HSPA8, the poly-A binding protein PABCl, 
nucleolin (NCL), and several ribosomal proteins^\ all of which 
were also identified here as components of the NKX3.1 interac- 
tome (Figure 1C,D, Data set lA). Most of these interactions were 
also represented in the Reactome network (Figure 2A). Two addi- 
tional subunits of this particle, ILF3 and YBXl were also identi- 
fied, albeit at low levels (Data set 1 A). hnRNPs function in multiple 
processes, including mRNA splicing, dynamics, stability, and 
translation, telomere maintenance, DNA repair, and chromatin remod- 
eling and transcription^^. They are also major constituents of the 
nucleolar proteome, which additionally comprises many of the 
NKX3.1 interacting proteins listed above, including the DNA-PK 
complex, PARPl, HSPA8, and ribosomal proteins as well as the 
RNA heUcases DDX3 and DDX5^^'^^ Although the significance of 
these interactions remains unclear, they may reflect a close physi- 
cal coupling of NKX3.1 -dependent mRNA transcription to mRNA 
processings^ and/or hitherto unappreciated role for NKX3.1 in 
nucleolar ribosome biogenesis and cytoplasmic mRNA transport. 
A similar proposition was made to rationalize the interactome of the 
transcription factor S0X2, which shares remarkable overlap with 
the NKX3.1 interactome^l 

Another highly abundant NKX3.1 interactor is the chromatin and 
nuclear assembly regulator BANFl (Figure ID). This interaction 
was confirmed by co-immunoprecipitation (Figure 2B). BANFl 
was previously shown to bind two other proteins identified in the 
NKX3.1 interactome, emerin (EMD) and thymopoetin (TMPO)^^. 
In addition, BANFl interacts with several other homeodomain tran- 
scription factors and regulates the transcriptional activity of one of 
them, CRX''". It is thus likely that BANFl, in complex with emerin 
and thymopoetin, is involved in NKX3.1 -mediated gene regula- 
tion. The nuclear matrix attachment proteins SAFA/HNRNPU and 
SAFB, which were also identified as NKX3.1 interacting proteins, 
may also participate in this process. 

Finally, we identified an interaction of NKX3.1 with the home- 
obox transcription factor H0XB13 (Data set IC). This interaction 
was confirmed by co-immunoprecipitation (Figure 2A). H0XB13 
also interacts with the androgen receptor and regulates the cellular 
response to androgen'' ^ In addition, germline mutations of HOXB 13 
significantly increase risk of hereditary prostate cancer through 
unknown mechanisms^^. However, further studies discounted the 
intriguing possibility that mutation of HOXB 13 alters its interaction 
with NKX3.1 (CCY & DAW, unpubhshed observation). 

NKX3.1 -induced transcriptional program 

Previous determinations of NKX3.1 -dependent gene expression 
signatures have profiled prostates of mice that developed and aged 
in the complete absence of NKX3.1^^''^'^'-°. These signatures may 
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therefore describe adaptive changes that occur in response to long- 
term depletion of NKX3.1 in addition to its immediate effects on 
gene expression. We have therefore chosen to acutely introduce 
NKX3.1 into immortalized human prostate epithelial cells (LH 
cells^^) that do not express detectable levels of NKX3.1 protein 
(data not shown). We produced adenoviruses driving the expres- 
sion of either GFP alone or GFP and NKX3.1 from separate 
promoters (Ad-GFP and Ad-GFP-NKX3.1 viruses, respectively). 
LH cells were infected with these viruses according to the scheme 
in Figure 3A. GFP signal became first detectable by live cell 
fluorescence microscopy 6 hours after infection (data not shown). 
We therefore harvested duplicate cultures of cells for immunoblot- 
ting 7 and 10 hours after infection and determined that NKX3.1 and 
GFP were expressed at both time points (Figure 3B). No cytopathic 
effects of adenovirus infection were observed within the time frame 



of the experiment. In parallel, we prepared duplicate RNA samples 
of the 7 hours and 10 hours time points for transcriptome analysis. 

The global changes in transcript levels noted in response to NKX3. 1 
expression were very similar at the 7 hours or 10 hours time points 
(Supplementary Figure S2A). Statistically significant changes were 
observed for several hundred mRNAs. To reduce the number of 
mRNA changes to be further interrogated to a manageable number, 
we arbitrarily set a cut-off of 5-fold change. This yielded lists of 
158 differentially expressed genes for the 7 hours time point (Sup- 
plementary Figure S2B) and 165 for the 10 hours time point. Since 
there was a considerable overlap of both lists, we limited the fur- 
ther analysis to the 7 hours sample. Data sets 2A and B summarize 
all mRNA expression data. Supplementary Table 1 presents a 
ranked list of all 107 mRNAs with > 5 -fold upregulation, whereas 
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Figure 3. Adenovirus-mediated expression of NKX3.1 in LH prostate epithelial cells regulates specific mRNAs. (A) Schematic 
representation of the time course of the experiment. LH cells were infected in duplicate with adenoviruses driving the expression of either 
GFP alone or GFP and NKX3.1 from two separate promoters. GFP expression became first apparent by fluorescence microscopy 6 hours 
after transfection (data not shown). (B) Duplicate cell lysates were prepared 7 and 1 0 hours after infection, and examined for the expression of 
GFP and NKX3.1 by immunoblotting. NKX3.1 expression was already detectable at the earliest time point (7 hours). (C) Quantitative RT-PCR 
analysis of 9 mRNAs whose expression is changed in response to NKX3.1. LH cells were infected with adenoviruses driving the expression 
of either GFP alone or GFP and NKX3. 1 , and mRNA was isolated after the indicated time points (6, 8, 10, 12 hours). The RNA samples were 
analyzed by Q-PCR, and expression values are shown as log2 transformed ratios of the mRNA level in NKX3.1 infected versus GFP infected 
cells (NKX3.1/GFP). Error bars indicate standard deviations obtained from two replicate measurements. The left panel shows data for 5 
mRNAs that were upregulated by NKX3.1 in the array dataset, whereas the right panel shows data for four mRNAs that were downregulated. 
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Supplementary Table 2 presents a corresponding list of all 51 
mRNAs with > 5-fold downregulation in NKX3.1 expressing LH 
cells (see also Data set 2C). We chose 5 upregulated and 5 down- 
regulated mRNAs for validation by Q-PCR with a fresh set of rep- 
licate RNA samples prepared from cells infected with Ad-GFP or 
Ad-GFP-NKX3.1 for increasing periods of time. Nine out of the 
10 expression changes confirmed the tendency seen from microar- 
rays, although variability was substantial for some measurements 
(Figure 3C). We failed to confirm the induction of KRT17 mRNA 
apparent from the array data (not shown). Additional validation by 
Q-PCR and immunoblotting is shown in various sections below (see 
Figure 6). 

Examination of the lists of mRNA changes revealed a fundamental 
reprogramming of gene expression in LH cells upon acute expres- 
sion of NKX3.1. Overall, the changes were indicative of inhibition 
of cell proliferation and induction of cell differentiation. For exam- 
ple, 9 epithelial differentiation markers (cytokeratins 5, 6B, 7, 8, 17, 
18, 19, stratifin, kallikrein 5) were strongly induced. In addition, the 
Notch pathway, which is often downregulated in prostate cancers^ ', 
was induced (DLLl, HESl, JAG2). The cyclin-dependent kinase 
inhibitor p21 (CDKNIA), which inhibits cell cycle progression and 
induces cell differentiation^^, was also increased. 

Reassuringly, many of the strongest NKX3.1 -induced mRNAs 
encode proteins that were previously shown to be downregulated 
in human prostate cancer based on immunohistochemistry (Supple- 
mentary Table 1). This included, for example, the calcium binding 
proteins S100A2 and A14^\ the 14-3-3 protein stratifin^^^'^^ laminin 
A^^ claudin 7^*^, prostasin^'^\ P cadherin^'^ and kallikrein 5^'^. Cyc- 
lin D2 is considered an activator of cell cycle progression but was 
induced by NKX3.1. Remarkably, however, cyclin D2 is typically 
downregulated in human prostate cancers^''. Four mRNAs encoding 
HSP70s were upregulated (Supplementary Table 1). HSP70 expres- 
sion is frequently lost in aggressive prostate cancers^^ and experi- 
mental HSP70 overexpression inhibits the tumorigenicity of prostate 
cancer xenografts in mice^'''. Likewise, three genes encoding the 
HSP70 co-chaperones DnaJ/HSP40 were upregulated > 5-fold. 
Lastly, two glutathione transferases were upregulated by NKX3.1, 
a finding that is consistent with the previous demonstration that 
NKX3.1 upregulates oxidative stress defense-". 

The list of downregulated genes (Supplementary Table 2) included 
genes involved in cell migration (actin/myo sin-related, collagens 
lAl, 5A1, 5A2), several growth factors, and the interferon/STAT 
pathway. Many of the most downregulated genes were previously 
shown to be overexpressed in prostate and other cancers (Supple- 
mentary Table 2). This applies, for example, to eukaryotic trans- 
lation elongation factor 1 alpha (EEF1A2) which is a potential 
oncogene^'^, the BMP antagonist gremlin and the transcription 
factor FOXDl^'l N-cadherin, which is frequently found to replace 
epithelial cadherin forms in prostate cancers ("cadherin switch") 
was also strongly downregulated^''^. Significantly, NKX3.1 also 
upregulated P cadherin thus reversing the cadherin switch. 

We also compared our list of 331 mRNAs that were changed 
> 3-fold by NKX3.1 with a recent list of 282 mouse genes thought 



to be direct NKX3.1 targets based on a combination of expression 
and ChlP-seq data^^\ Despite the species difference and the diamet- 
rical strategies (overexpression versus knockout), 10 genes were 
represented on both lists (Supplementary Table 3). This overlap is 
highly significant when considering that 8 out of these 10 genes 
were regulated by NKX3.1 in the same direction. 

Pathway analysis 

To assess functional modules and signaling pathways affected by 
NKX3.1, we performed a global analysis with the Ingenuity Path- 
way Analysis (IPA) package. The analysis was performed with the 
dataset of mRNAs changing more than 5-fold ("5x dataset") or, 
where indicated, with a larger dataset of mRNAs changing more 
than 3-fold ("3x dataset", 357 genes). Since identical top scoring 
pathways were obtained with both datasets, the analysis was largely 
restricted to the smaller 5x dataset, unless otherwise noted. 

Consistent with the involvement of NKX3.1 in prostate develop- 
ment, we found highly significant overrepresentation of IPA "Func- 
tions" pertaining to development, cell movement, proliferation and 
cell growth (Figure 4A). Of particular interest was the term "Repro- 
ductive Systems Disease", which included the subgroup "Prostatic 
intraepithelial neoplasia" (PIN). PIN is the earliest known precursor 
lesion of prostate cancer, and frequently shows decreased NKX3.1 
levels^". The "PIN" Function contained the seven genes listed in 
Figure 4B. A previous study determined that six of these genes were 
downregulated in PIN versus normal prostate, whereas one was 
upregulated^ ^ Remarkably, five out of the seven genes displayed 
a mirror image of the changes occurring in PIN when examined in 
NKX3.1 -expressing LH cells (Figure 4B). These findings suggest 
that changes in gene expression in early PIN may be causally linked 
to loss ofNKX3.1. 

As shown in Figure 4C, a number of pathways were overrepre- 
sented that were not readily apparent from the manual curation of 
the gene lists presented above. For example, the analysis indicated 
upregulation by NKX3.1 of the p53 and ILl pathways, in addition 
to the Notch signaling pathway. Interferon signaling, in turn, 
appeared to be switched off by acute NKX3.1 expression. 

Network analysis 

TNFa network. To obtain a better understanding of the regulatory 
circuitry underlying NKX3.1 -induced modulation of particular func- 
tional pathways, we performed network analysis using Ingenuity 
IPA software. The highest ranking network presented in Figure 5A 
featured TNFa, a gene that was induced by NKX3.1 (Supplemen- 
tary Table 1, Figure 6A), in the center with edges reaching to 27 
distinct nodes. Eighteen of these edges were defined by a gene regu- 
latory relationship (i.e. expression edge) thus signifying genes that 
are known to be either induced or suppressed by TNFa signaling. 
Further annotation of the TNFa network also connected TNFa to 
NKX3.1 -induced suppression of cell movement through downregu- 
lation of action-myosin based mobility components and enhance- 
ment of cell adhesion through upregulation of laminins (Figure 5A). 
Both processes are considered bona fide hallmarks of tumor sup- 
pression. Close examination of every TNFa expression edge 
revealed considerable concordance between the definition of the 
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Figure 4. Functions and pathways that are overrepresented in the NKX3.1 gene expression program. (A) Select IPA "Functions" 
significantly overrepresented in the 5x mRNA set. (B) List of mRNAs with inverse expression in prostatic intraepithelial neoplasia (PIN;^^) 
and NKX3.1 expressing LH cells. mRNAs shown in red are upregulated whereas those shown in green are downregulated. (C) Select IPA 
"Canonical Pathways" overrepresented in the 5x dataset. The abscissa on the top indicates the percent fraction of all possible pathway 
components that were represented in the dataset. Since this dataset only contained a relatively small number of 158 mRNAs, a small percent 
wise overrepresentation of pathway components is statistically highly significant (p < 0.05, see yellow graph). 



edge (based on the published Uterature) and the actual expression 
of the target node in response to NKX3.1. Fourteen first degree 
nodes predicted to be activated by TNFa were also upregulated by 
NKX3.1 (Supplementary Table 4). Consistent with MAP kinase 



signaling being a major downstream pathway activated by TNFa, 
we found that a chemical inhibitor of JNK but not p38 could par- 
tially antagonize NKX3.1 -induced expression of HSPA6 and HESl 
(Figure 6B). 
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Figure 5. IPA network analysis of the NKX3.1 transcriptional program. (A) TNFa network. Node colors represent the level of up- (red) or 
down- (green) regulation upon expression of NKX3.1 . (B) Tumor suppressor p53 network. The p53-TERT-EGF-JUN quadrangle is highlighted 
by dark blue edges. (C) MYC network. First degree edges of MYC are highlighted in light blue. (D) PDGFB/TGFp network. First degree edges 
are highlighted in light blue, the PDFGB-TGF(3 link in dark blue. 



p53 network. Another high scoring network featured the tumor 
suppressor p53 at the center with first degree edges to 8 nodes. 
Although p53 was upregulated neither at the mRNA nor protein 
level (Figure 6C), a finding which is consistent with the well- 
established activation of p53 at the post-translational level, the net- 
work indicated robust induction of some of its known target genes. 
As shown in Figure 5B, this included the 14-3-3 sigma protein strat- 
ifin (SFN), an epithelial differentiation marker missing from many 
prostate cancers^^''^% the cyclin-dependent kinase inhibitor p21 (CD- 
KNIAJ'), and the p53 apoptosis effecter PERP^^ Induction of p21 
protein by NKX3.1 was confirmed by immunoblotting (Figure 6C). 
Annexin A8 (ANXA8) is also known to be upregulated by p53^^ 



Using the 3x dataset, we pinpointed an additional 7 mRNAs that are 
upregulated by NKX3.1 as known targets of p53 (Supplementary 
Figure S3). These findings suggested that the p53 tumor suppressor 
pathway is activated by acute induction of NKX3.1 in LH cells. The 
network contained three additional highly connected nodes, telom- 
erase (TERT), EGF, and JUN, which formed a quadrangle with p53. 
Although JUN mRNA was not induced by NKX3.1, a positive 
effect of p53 on JUN was reported previously 

MYC network. A further high scoring network that was obtained with 
the 3x dataset was organized around the MYC oncogene (Figure 5C). 
MYC itself was 4-fold downregulated by NKX3.1 expression. 
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Figure 6. NKX3.1 -induced changes in gene and protein expression. (A) Quantitative RT-PCR analysis of TNFa mRNA. LH cells were 
infected with adenoviruses driving the expression of either GFP alone or GPP and NKX3.1, and mRNA was isolated after the indicated time 
points (6, 8, 10, 12 hours). The RNA samples were analyzed by Q-PCR with two different primer sets amplifying TNFa mRNA, and expression 
values are shown as log2 transformed ratios of the mRNA level in NKX3.1 infected versus GFP infected cells (NKX3.1/GFP). Error bars 
indicate standard deviations obtained from two replicate measurements. (B) LH cells were infected with adenoviruses driving the expression 
of either GFP alone or GFP and NKX3.1. After 4 hours, 10 pM of the JNK inhibitor SP600125 or 10 pM of the p38 kinase inhibitor SB203580 
were added followed by mRNA isolation after 6 hours. The levels of HSPA6 and HES1 were analyzed by Q-PCR. Expression values are 
shown as log2 transformed ratios of the mRNA level in NKX3.1 infected versus GFP infected cells (NKX3.1/GFP). Error bars indicate standard 
deviations obtained from two replicate measurements. (C) LH cells were infected with adenoviruses driving the expression of either GFP alone 
or GFP and NKX3.1, and protein lysates were prepared after the indicated time points (6, 8, 10, 12 hours). The expression of the indicated 
proteins was determined by immunoblotting. Cropped blot images are shown; see Figure S8. for full images. (D) LH cells were infected with 
Ad-GFP and Ad-GFP-NKX3.1 viruses, and the rate of DNA synthesis was measured by EdU incorporation after the indicated times (top 
graphs). The percentage of GFP positive cells was determined as a measure of infection efficiency (bottom graphs). (E) LH cells were 
infected with Ad-GFP-NKX3.1 virus, and the effect of JNK inhibitor (SP600125, 20 pM) or p38 kinase inhibitor (SB203580, 20 pM) on NKX3.1- 
mediated suppression of DNA synthesis was measured by EdU incorporation. The percentage of GFP positive cells was determined as a 
measure of infection efficiency (bottom graphs). (F) LH cells were infected with Ad-GFP-NKX3.1 virus, and the effect of neutralizing antibodies 
to TNFa or control IgG on NKX3.1 -mediated suppression of DNA synthesis was measured by EdU incorporation. The percentage of GFP 
positive cells was determined as a measure of infection efficiency (bottom graphs). 



an effect that was validated by immunoblotting (Figure 6C). This 
coincided with downregulation of several genes that were previ- 
ously found to require MYC function for their expression (TXNIP, 
IFI16^^). In addition, the MYC interaction partner PARPIO was 
downregulated upon expression of NKX3.L Conversely, two genes 
that are negatively regulated by MYC were activated upon NKX3.1 
expression (PERP^^ NDRG''^), suggesting that NKX3.1 -induced 
downregulation of MYC relieves its repressive effect on these genes. 
In aggregate, these findings suggest that restoration of NKX3.1 



expression in LH cells led to downregulation of pathways normally 
turned on by MYC. This may contribute to a block in prolifera- 
tion and promote cell differentiation by NKX3.1. Antagonism of 
NKX3.1 and MYC in target gene regulation and prostate tumori- 
genesis was recently also demonstrated in a mouse modeF^. 

PDGFB/TGFP network. Another network featured PDGPp 
(PDGFB and PDFGBB), which was induced 5.1-fold by NKX3.1. 
The induction of PDGFB mRNA and the expression of many of its 
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first degree interacting nodes, is consistent with PDGFB signaling 
being upregulated by NKX3.L For example, three nodes that were 
upregulated by NKX3.1 (CRYAB, SERPINA3, CDKNIA) and two 
nodes that were downregulated (DAB2, TAGLN) were previously 
shown to be controlled by PDGFB in the same manner (Supple- 
mentary Figure S4;^°'^0. PDGFB is also known to activate PPAR/ 
RXRa-dependent transcription. Notably, RXRa is itself upregulat- 
ed by NKX3.1 (5.7-fold), hence explaining the overrepresentation of 
PPAR signaling in the canonical pathway analysis above (Figure 4C). 
Since PPAR signaling is known to suppress prostate cancer cell 
proliferation^^, it may be relevant to NKX3.1 -mediated tumor 
suppression. 

PDGFB shares a number of nodes with another growth factor, 
TGFP (Figure 5D). Although TGF(31 mRNA was not altered by 
NKX3.1, the more abundantly expressed TGFp2 was downregu- 
lated (Supplementary Table 5). Most first-degree nodes emanating 
from TGFp were downregulated by NKX3.1 expression (Supple- 
mentary Figure 3). An additional 25 genes in the TGFp signaling 
pathway were either downregulated or unchanged by NKX3.1, 
further suggesting that NKX3.1 does not activate TGFp signal- 
ing (Supplementary Table 5). Since TGFp is a strong driver of the 
epithelial-to-mesenchymal transition (EMT,^^), NKX3.1 -mediated 
suppression of TGFp signaling may contribute to its differentiation- 
inducing activity. 

Network connectivity 

In an attempt to obtain a more cohesive view of the global effects 
of NKX3.1 on prostate gene expression, we merged individual 
networks. For simplicity, only expression edges were included in 
Figure 7 A. Not only were TNFa and p53 directly linked through an 
expression-based edge, but several of their individual first degree 
nodes were targets of edges emanating from both TNFa and p53. 



For example, TFP12 and CASP4 are positively regulated by both 
TNFa and by p53^6'^'^-^^ 

The API transcription factor subunit JUN, which was part of the 
p53 network (Figure 5B) was linked to TNFa resulting in a tri- 
angular configuration (Figure 7A). Whereas both TNFa and p53 
are known to stimulate the expression of JUN and API activity 
NKX3.1 expression did not significantly affect the mRNA level 
of cJUN (-1.21-fold change) or JUND (+1.25-fold change). How- 
ever, the JUN interaction partner FOS was increased 3.9-fold by 
NKX3.1. Since FOS maintains exactly the same edges within the 
network as JUN (data not shown), API transcriptional activity 
appears to be upregulated in response to NKX3.1 expression. 

Finally, we manually integrated the TNFa network with the con- 
nections to all major factors the network analysis had implicated in 
the NKX3.1 transcriptional program, including FOS/APl, MYC, 
and p53. Despite the complexity of the resulting network, a ten- 
tative framework for NKX3.1 -induced transcriptomic changes is 
becoming readily apparent (Figure 7B, C). According to this frame- 
work, NKX3.1 expression in LH cells results in the activation of the 
TNFa pathway. This in turn leads to activation of the p53. Notch, 
PDGFB, and API pathways. Conversely, the MYC and interferon/ 
STAT pathways are turned off. Through Q-PCR and immunoblot- 
ting, we have already confirmed several of these predictions (see 
Figure 3C for p53. Notch, PDGFB, STAT, and Figure 6 for TNFa, 
MYC, and p53). In addition, transduction with NKX3.1 expressing 
virus led to growth inhibition of LH cells relative to virus express- 
ing GFP alone (Figure 6D). Notably, growth inhibition was partially 
rescued by JNK inhibitor and by a neutralizing antibody against 
TNFa (Figure 6E, F). These observations further support a role of 
NKX3.1 in inducing a block to cell division and promoting cell dif- 
ferentiation via a TNFa/JNK/APl -dependent pathways. 




Figure 7. Framework of the NKX3.1 transcriptional program. (A) The merged TNFa-p53 network. Network links are highlighted in yellow. 
Direct edges between TNFa, p53, and JUN are emphasized in blue color. (B) Construction of a network containing the major factors 
implicated in the NKX3.1 transcriptional program, including F0S/AP1, MYC, and p53. Modules activated by NKX3.1 expression are shaded 
in red and those suppressed in green. (C) Tentative framework of NKX3.1 -dependent changes to cellular modules. Based on the induction of 
TNFa and FOS mRNA by NKX3.1 , and the antagonistic effects of JNK inhibitors on NKX3.1 -mediated gene expression and cell proliferation, 
the framework proposes that TNFa signaling results in activation of AP1 and modulation of downstream genes and functional modules (red 
squares symbolize upregulation/activation, green squares downregulation). Additional pathways (stippled lines) may impinge on SRF and 
other transcription factors (not shown). 



Page 14 of 28 



FIOOOResearch 2014, 3:1 15 Last updated: 21 AUG 2014 



Enrichment of transcription factor binding sites 

We next employed the NextBio platform to relate our expression 
data to previously published large-scale genomics data. One data- 
set that matched with high statistical significance (p = 4.5E-11) 
featured a set of 1082 genes containing evolutionarily conserved 
genomic binding sites for APPl Twenty six of these genes were 
represented in our list of -150 NKX3.1 responsive genes with 20 
being induced by NKX3.1 (Supplementary Table 1, Supplementary 
Table 2, Supplementary Figure 5 A, Data set 2D). Combined with the 
evidence from network analysis and the upregulation of FOS, these 
findings suggest that NKX3.1 causes API activation and/or cooper- 
ates with API in gene activation. Consistent with this conjecture is 
the well-known induction of JUN N-terminal kinase (JNK) activity 
by TNFa signaling, which enhances the transcriptional activity of 
JUN. Finally, NFkB which is also induced by TNFa signaling, can 
cooperate with API at some promoters^"^. 

A second DNA binding motif that was overrepresented (p = 1.6E"^) 
in NKX3.1 responsive genes conforms to a binding site for serum 
response factor (SRF). 216 human genes contain the serum response 
element (SRE) motif in a promoter proximal context that is con- 
served in mouse, rat, and dog^l These 216 genes included 9 genes 
that were represented on our dataset, all but one of which was 
suppressed by NKX3.1 (Supplementary Table 2, Supplementary 
Figure 5B, Data set 2E). Since NKX3.1 is known to physically 
interact with SRF^^ our data strongly suggests that NKX3.1 coop- 
erates with SRF in transcriptional suppression. 

Comparison with human prostate cancer data 

Nextbio analysis also revealed a highly significant match with a 
study comparing gene expression in human prostate cancer tis- 
sues^^ This study profiled 22 cell lines derived from surgical sam- 
ples of prostate cancer patients with clinically localized disease 
and absence of hormonal neo-adjuvant treatment before surgery. 
In keeping with these selection criteria for early cancers, the cell 
lines (and primary tumors they were derived from) had suffered 
loss of 8p21 (i.e. NKX3.1) but did not display genetic abnormali- 
ties typical of more advanced prostate cancers (e.g. loss of PTEN, 
amplification of MYC and androgen receptor). 3415 mRNAs were 
significantly changed in prostate cancer cell lines relative to normal 
prostate. 

Of 153 differentially expressed genes in our dataset, 82 (53%) 
were also changed in prostate cancer derived cell lines (PCaDCL), 
a highly significant overlap (p = 2.0E"^^, Supplementary Figure 6; 
Data set 2F). Of the 82 overlapping genes, 60 were downregulated 
and 22 were upregulated in PCaDCL versus PrEC. Strikingly, 93% 
of the mRNAs downregulated in PCaDCL were induced by expres- 
sion of NKX3.1 in LH cells (Supplementary Table 1). In addition, 
19 of the 20 genes upregulated in PCaDCL were downregulated by 
NKX3.1 (Supplementary Table 2). Moreover, many of the mRNA 
expression changes observed in the PCaDCL microarray study 
were independently confirmed at the protein level by immuno- 
histochemistry of prostate cancer tissue samples (Supplementary 
Table 1 and Supplementary Table 2). These analyses strongly sug- 
gest that the principal gene regulatory networks that are affected 



by NKX3.1 expression in LH cells are inversely perturbed in early 
human prostate cancer marked by loss of this tumor suppressor. 



NKX3.1 expression and interactions Dataset 

9 Data Files 

http://dx.doi.org/10.6084/m9.figshare.1002064 

V J 

Discussion 

We have employed a series of global approaches to explore the 
tumor suppressor function of NKX3.1. The NKX3.1 interactome 
revealed a complex pattern of interactions with DNA repair proteins 
and with other transcriptional regulators such as ILF2 and BANFl 
that predict a similarly complex transcriptional program enacted by 
NKX3.1. Indeed, global analysis of the gene expression pattern 
actuated by acute expression of NKX3.1 in immortalized human 
prostate epithelial cells with a basal phenotype (LH cells-^ '^^^ revealed 
a rapid and extensive re-programming with 158 mRNAs changing 
> 5-fold and 331 mRNAs changing > 3-fold. This complex pattern 
was interrogated by network analysis to account for the recogni- 
tion that representation of cellular processes and reactions as linear 
pathways is often an oversimplification that does not accurately 
reflect the complexity of intracellular wiring'^ ^ 

Network analysis indicated NKX3.1 -dependent modulation of a 
series of interconnected functional modules and enabled a tentative 
framework for the transcriptional program induced by NKX3.1 in 
human prostate epithelial cells. Broadly speaking, NKX3.1 activa- 
tion culminates in the downregulation of cellular motility as well 
as MYC and IFN/STAT activity and in the upregulation of p53 
activity, the Notch pathway, and PDGF signaling (Figure 7C). 
Many of these changes are readily consistent with the tumor sup- 
pressor function of NKX3.1 observed in knockout mice^"^ 

Importantly, network analysis allowed us to pinpoint several unan- 
ticipated pathways on which NKX3. 1 appears to impinge. For exam- 
ple, the analysis suggested a major role for TNFa whose mRNA 
was induced by NKX3.1. TNFa is a well-established inducer of 
MAP kinase signaling, including JNK and p38 kinases. Signifi- 
cantly, ILla was also induced by NKX3.1 (Supplementary Table 1) 
thus further augmenting MAPK activation. JNK activates API 
transcriptional activity thus readily rationalizing the strong over- 
representation of API binding sites in NKX3.1 responsive genes. 
Localized NKX3.1 -mediated TNFa- JNK signaling in prostate 
epithelial cells may promote and maintain their differentiation 
state thus suppressing tumorigenesis. The important role of JNK 
signaling in cell differentiation is well established'^^'^l The finding 
that pro-inflammatory cytokines also destabilize NKX3.1 protein^^ 
indicates a negative feedback loop that may counteract their pro- 
apoptotic function (Figure 7C). 

Importantly, the NKX3.1 -induced gene signature is, to a large extent, 
a mirror image of the gene expression pattern found in early human 
prostate cancers devoid of NKX3.1^\ This inverse pattern further 
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suggests that NKX3.1 is a key driver of luminal cell differentiation, 
whereas loss of NKX3.1 would allow luminal cells to dedifferenti- 
ate into a state with higher proliferative capacity thus making them 
more vulnerable to the acquisition of additional oncogenic events 
perhaps augmented by concurrent defects in DNA repair. Clearly 
such additional events are essential for prostate carcinogenesis 
given that PIN in NKX3.1 knockout mice does not progresses to 
overt prostate cancer, unless further genetic changes are incurred^ l 

Data availability 

figshare: NKX3.1 expression and interactions Dataset. Doi: 
10.6084/m9.figshare. 1002064^^4 
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Figure S1 . Transfection of FLAG-NKX3.1 expression into LNCaP cells and affinity purification. (A) LNCaP cells were transfected with 
pFLAG-NKX3.1 plasmid or with the empty pFLAG vector. Total cell lysate (lanes 1 and 2) was absorbed to anti-FLAG resin and eluted with 
FLAG peptides (lanes 5 and 6). The depleted cell lysate after affinity purification is shown in lanes 3 and 4. Immunoblots were probed with 
the indicated antibodies. The blot with NKX3.1 shows the overexpressed FLAG-NKX3.1 and the endogenous NKX3.1 protein (middle panel). 
Actin was used as loading reference. Cropped blot images are shown; see Figure S9 for full images. (B) LNCaP cells were transfected with 
pFLAG-NKX3.1 plasmid, and FLAG-NKX3.1 was detected by indirect immunofluorescence staining with FLAG or NKX3.1 antibodies. 
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log2 Fold Change 



Figure S2. Global gene expression signature of NKX3.1 expression in LH cells. (A) Differential gene expression 7 and 10 li after NKX3.1 
expression in LH cells. Note the overall similarity of gene expression differences between GFP and NKX3.1 expressing LH cells at both time 
points (7 h and 10 h). (B) "Volcano Plot" of differentially expressed genes at the 7 h time point. Features marked in red differed significantly 
5-fold between GFP and NKX3.1 expressing samples. 
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Figure S3. p53-linked expression changes. IPA-based rendering of mRNAs contained in the 5x datasets that were previously shown to be 
regulated by p53. 
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^ERPINA3^ 




Figure S4. Schematic depiction of PDGFB and TGPp expression edges according to IPA and comparison with the actual behavior 
of first degree nodes in response to NKX3.1 expression in LH cells. Green color indicates upregulation, whereas red color signifies 
downregulation. The arrows represent the expression edges. Solid arrows indicate agreement between observed expression behavior and 
the behavior expected in response to activation of PDFGB or TGF(3 according to the information contained in the IPA database. The stippled 
arrows indicate disagreement. Example: PDGF is expected to upregulate HES1 . Induction of PDGF by NKX3.1 is therefore consistent with the 
change in HES1 mRNA (edge is solid red arrow). PDGFB is also expected to upregulate THBS1 (red edge), but NKX3.1 expression leads to 
suppression of THBS1 . Hence the edge is a stippled arrow. 
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Figure S5. Enrichment of putative conserved AP1 and SRF binding sites in genes affected by NKX3.1 expression in LH cells as 
determined with NextBio. (A) The top panels summarize the datsets: Bioset 1 = 5x dataset of mRNAs affected by NKX3.1 expression; 
Biogroup 1 = AP1 binding site gene set according {o\ The bottom panel illustrates the overlap between Bioset 1 and Biogroup 1 in a Venn 
diagram (left) and in bar graphs (right). The bar graph shows that most genes containing conserved AP1 binding sites are activated by 
NKX3.1 expression. The individual genes are indicated in Supplementary Table 1 and Supplementary Table 2. (B) Same as above for serum 
response factor (SRF). 
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Figure S6. Overlap in mRNA expression between the 5x dataset and in human prostate cancer derived cell lines. (A) The top panels 
summarize the datasets: Bioset 1 = 5x dataset of mRNAs affected by NKX3.1 expression; Bioset 2 = Prostate cancer derived cell lines versus 
normal prostate epithelial cells^. The bottom panel illustrates the overlap between Bioset 1 and Bioset 2 in a Venn diagram (left) and in bar 
graphs (right). The bar graph highlights the largely opposite gene expression patterns in the two biosets. The individual genes are indicated 
in Supplementary Table 1 and Supplementary Table 2. 
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Figure S9. Uncropped immunoblots for Figure S1. 
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Supplementary Table 1 Genes upregulated in NKX3.1 overexpressing LH cells 







Fold Change Common 


Genbank 




95 81 


6045.64 








_UUZ 1 00. o 


heat shock 70kDa protein 6 (HSP70B') (HSPA6) 


110 46 




49 5 


KRTt7 


NM 


000422 1 


keratin 17 (KRT17) 


119 76 


5346 03 


44 6 


S100A2 


NM 


005978 3 




93 23 


4068.60 


43 6 




NM 


000424.2 


keratin 5 (KRT5) 


93.98 


3943.20 


42 0 


SFN^ 


NM 


006142.3 


stratifin (SFN) 






41 1 


VAMP8 


nm" 


003761 2 




97 57 


3661 .97 


37 5 




NM 


005130.3 


fibrob^st^grow^^^^ (VAMP8) 


107 32 




34 6 


CCND2 


NM 


001 759 2 


cyclin D2 (CCND2) 


121 60 


4128 94 


34 0 


LAMAS 


NM 


198129 1 




88 46 


2837.16 


32 1 




nm" 


002276 3 


keratin 19 (KRT1 9) 


144 93 




294 


C0L1 7A1 


NM 


130778 1 




145 78 


3897 20 


26 7 


SPINT2 


NM 


021 1 02 2 


serine^peptidase^inNl^^^^^ Kun'itz ty^pe V(SmNT2')^^'^'^"* ^'^"'^ 


115 19 


2687 88 


23 3 




NM 


001630 1 


annexin A8 (ANXA8) 


105 01 


2180 30 


20 8 


SCNN1 A 


NM 


001 038 4 


^uanine'^nucleotidTbi!^^^^ (V'rotein)^al'^ha ^1 5 (G class) (GNA1 5) 


101 89 


2069.73 


20 3 




NM 


002068 1 




89 10 




19 4 


C19orf33 


nm" 


033520.1 


chrorTO^someT9'open^^^ qcass)( 


107 32 


2049.15 


19 1 




NM 


005555 2 


keratin 6B (KRT6B) 


103 83 




19 0 


S100A14 


NM 


020672.1 


S100 calcium binding protein A14 (S100A14) 


101.98 


1897.81 


18 6 




nm" 


002775 3 


HtrA serine peptidase 1 (HTRA1) 






18 2 


ALDH1 A3 


NM 


000693 1 




109 01 


1977 00 


18 1 




NM 


006942 1 


SRY (sex determining region Y)-box 15 (S0X15) 


100 36 


1788 35 


17 8 


CLDN7 


NM 


001307.3 


claudin 7 (CLDN7) 


100 53 


1756 72 


17 5 


FEZ1 


NM 


005103 3 




93 82 


1557 91 


16 6 


PRSS8 


NM 


002773 2 


p^rotease^'serkie^ 8 "prostasin^c'pRS^ ^ ^'^^^^ ^' ^'^^'^^^"^^ vanant 1 


117 23 


1908.68 


16 3 




NM 


005618.2 


delta-like 1 (Drosophila) (DLL1) 


87 68 


1366.63 


15 6 


CDHS 


nm" 


001 793 3 


cadherin 3, type 1 , P-cadherin (placental) (CDH3) 


108 56 


1690.53 


15 6 




NM 


001085 4 




120 16 


1838.50 


15 3 




NM 


005558 3 


ladinin 1 (LAD1 ) 


154 01 


2268 36 


14 7 


LOC649970 


xm" 


939056.1 




179.94 


2561.59 


14 2 


DNAJA4 


NM~ 


018602.2 


DnaJ (Hsp40) homolog, subfamily A, member 4 (DNAJA4) 


111.14 


1523.41 


13 7 


SH3PXD2A 


NM 


014631 2 


SH3 and PX domains 2A (SH3PXD2A) 






13 6 




NM 


021991 1 




161 68 


2091 77 


12 9 


CYGB 


nm" 


134268.3 


cytoglobin (CYGB) 


97 49 


1212.63 


12 4 


SLPI 


NM 


003064 2 




106 40 


1300.93 


12 2 


TNF 


NM 


_uuuoy4.z 


t^umoTniecrosirfector^ 2) (TNF) 


138 73 


1643 40 


118 


CRYAB 


nm" 


001885 1 


crystallin alpha B (CRYAB) 


111 78 




118 




NM 


022121 2 


pgpp JP53 apoptosis effector (PERP) 


98 74 


1 1 38 99 


115 


GJB2 


NM 


004004 3 




93 66 




112 


RAB25 


nm" 


020387.1 


RAB25 member RAS oncogene family (RAB25) 


132 86 


1485 78 


112 


ALDOC 


NM 


005165 2 


aldolase C, fructose-bisphosphate (ALDOC) 


114 30 


1254 82 


110 


CKMT1 A 


NM 


00101500' 


chromrsomeTo o'^enreadi^T' flarie^'l'e (ClOorfl' 16^^^ encoding mitochondnal protein 


102 38 


1073 89 


10 5 


ClOorfl 16 


NM 


006829 2 




101 20 


1032.76 


10 2 




NM 


199161 1 


serumTmyloid Al''(SAA^^ transcr^^ variant 2 ° 


1144 71 




10 1 


HSPA1A 


nm" 


005345.4 


heat shock 70kDa protein 1A (HSPA1A) 


105 70 


1004.56 


9 5 


FLJ34922 


NM 


152270 2 


likely ortholog of mouse schlafen 8/9 (FLJ34922) 


144 26 




9 5 


SERINC2 


NM 


178865.3 


serine incorporator 2 (SERINC2) 


152.90 


1405.69 


9 2 




NM 


00100561! 


microsorraMutat^^ transcri t variant lb 


^78 62 


1 180 33 


9 1 


MGST1 


NM 


020300 3 








9 1 


CA2 


NM 


000067 1 


rarbon?(ranh drase"" (CA2T"^ ^'^^^^ ranscnp vanan 


128 99 


1167 18 


9 0 


LAMC2 


NM. 


_018891.1 


iTrninin"^ amrna'^2TAMC2) transcri t variant 2 


103 78 


923 38 


8 9 


TACSTD1 


NM_ 


_002354.1 


tumor assodated calcium si nal traTsd^uceTl (TACSTD1) 


222.88 


1936.09 


8 7 


CAI\/IK2N1 


NM_ 


_01 8584.4 


caldu"r^Ta°mod^ulin^de^ rotein kinase II inhibitor 1 (CAMK2N1) 




3779.90 


8 6 




NM. 


_005560.3 


iTm^nin a^ ha 5^(LAmT5)" '"^^^ in i i or ( 


^8945 




8 6 


CRABP2 


NM_ 


_00 1878.2 


cellular retinoic acid bindin rotein 2 (CRABP2) 


124 94 


1032 32 


8 3 


ClOorflO 


NM_ 


_007021.2 


chror^L^orre°'l0^open?eadkig^ 10 (ClOorflO) 


105 80 




8 2 




NM_ 


_001 323.2 


cystatin E/M (CST6) 


122 14 


998 28 


8 2 


IVIGC59937 


NM. 


_1 9900 1.1 


Similar to RIKEN cDNA 2310002J15 gene (MGC59937) 


210 97 






NDRG1 


NM_ 


_006096.2 




132 68 


1045 61 


79 




NM_ 


_000331.2 


serum amylotd'^A^'^SMl'^^^ ^ 


94 15 


732 60 


7 8 


nvn 1 9 


NM. 


_021220.2 


ovo-like 2 (Drosophila) (0V0L2) 


189 75 


1449 30 


7 6 


C3 


NM_ 


_000064.1 


har'^anTenhancTr 0^^^^ "lit^l''^(Droso hila) (HES1) 


141 54 




7 6 


HES1 


NM_ 


_005524.2 




524 45 


3993.87 


7 6 


CDKN1 A 


NM. 


_078467.1 


cycXi^dependent ki^^^ Cipl ) (CDKN1 A) transcript variant 2 


402 25 


3054.38 


7 6 


DNAJB1 


NM_ 


_006145.1 


DnaJ (Hsp40) homolog subfamily B member 1 (DNAJB1) 


99 55 




7 3 


LOC652846 


XM_ 


_942545.1 




120 22 


855 41 


7 1 




AF432419 


'^^Ip'^^^^' co'^p^et'e cdT^'^" ^ (Annexin VIII) (Vascular anticoagulant-beta) mRNA. 


106 48 


750 63 


7 0 


LOC652878 


XM_ 


_942594.1 


PREDICTED: similar to heat shock 70kDa protein 6 (HSP70B) (LOC652878) 


98 31 


689 97 


7 0 




NM_ 


_002226.3 




108.89 


757.99 


7 0 


KLK5 


NM_ 


_01 2427.3 


kallikrein 5 (KLK5) 






6 9 


RASD1 


NM. 


_01 6084.3 


RAS dexamethasone-induced 1 (RASD1) 


44941 


3093 93 


6 9 


TFPI2 


NM_ 


_006528.2 




104 51 




6 7 


MGC61598 


XM_ 


_939432.1 


PREDICTED" similar to ankyrin-repeat protein Nrarp (MGC61598) 


107.73 


707.28 


6 6 


SERPINB5 


NM. 


_002639.2 








6 4 


IL1A 


NM_ 


_000575.3 


fnte'rlL'k^ri'^ a^pha'^(IL1A^^^ '^'^"^^ ^ (ovalbumin), member 5 (SERPINB5) 


104 82 


672 27 


6 4 


LCN2 


NM_ 


_005564.2 


lipocalin 2 (oncogene 24p3) (LCN2) 


95 65 


605 92 


6 3 




DN919451 


MCF7RNAL26H10TF; cDNA clone MCF7 RNA L 26 H 10, mRNA sequence 


166 88 


1039.66 


6 2 


STK32B 


NM_ 


_018401.1 


serine/threonine kinase 32B (STK32B) 


90.80 


554.44 


6 1 


GJB3 


NM_ 


_00 100575: 








6 1 


PYCARD 


NM_ 


_145183.1 


PYD and CARD domain containing (PYCARD), transcript variant 3 


1796 87 


10861 .73 


6 0 


HSPA1B 


NM_ 


_005346.3 


heat shock 70kDa protein IB (HSPA1B) 


102.67 


619.47 


6 0 


IVIAL2 


NM. 


_052886.1 








6 0 


CD14 


NM_ 


_000591.1 


CD14 antigen (CD14) 


156 19 


924 35 




C1orf130 


NM_ 


_001 010981 




252 76 


1474 02 


58 


GPR56 


NM. 


_20 1525.1 


G^protefri' coupled^rec^ Transcr^pt^ variant 3 


165 69 


95047 


5 7 


GALNAC4S-65 NM_ 


_0 15892.2 


B cell RAG associated protein (GALNAC4S-6ST) 


1465 73 


8400.42 


5 7 


MT1X 


NM_ 


_005952.2 


metallothionein IX(MTIX) 


174 41 




5 7 


RXRA 


NM. 


_002957.3 


retinoid X receptor alpha (RXRA) 


192 07 


1095 07 


5 7 


ARHGEF16 


NM. 


_0 14448. 2 


Rho guanine exchange factor (GEF) 16 (ARHGEF16) 


84.77 


482.18 


5 7 


SNX10 


NM_ 


_01 3322.2 


sorting nexin 10 (SNX10) 




908 48 


5 6 


TPD52L1 


NM_ 


_00100339( 




100 06 




5 5 


CKMT1B 


NM. 


_020990.3 


c^reTtin'e kinase^mito^^ encoding mitochondrial protein 


78 90 


435 00 


5.5 


TMEM30B 


NM_ 


_001 017971 


transmembrane protein 30B (TMEM30B) 


75.90 


413.88 


5.5 


C0L17A1 


NM_ 


_000494.2 


collagen, type XVII, alpha 1 (C0L17A1), transcript variant long 


116.28 


629.60 


5.4 


CYP27B1 


NM. 


_000785.3 


cytochrome P450, family 27, (CYP27B1 ), nuclear gene encoding mitochondrial protein 


89.61 


481.15 


5.4 


PADI3 


NM_ 


_016233.1 


peptidyl arginine deiminase, type III (PADI3) 


103.59 


551.70 


5.3 


TNFRSF6B 


NM_ 


_003823.2 


tumor necrosis factor receptor superfamily, member 6b, decoy (TNFRSF6B) 


141.20 


746.41 


5.3 


CASP4 


NM_ 


_033306.2 


caspase 4, apoptosis-related cysteine peptidase (CASP4), transcript variant gamma 


105.73 


558.55 


5.3 


ADAIVI8 


NM. 


_001 109.2 


ADAM metallopeptidase domain 8 (ADAM8) 


620.09 


3247.84 


5.2 


KRT7 


NM_ 


_005556.3 


keratin 7 (KRT7) 


71.65 


373.96 


5.2 


SGPP2 


XM_ 


_938742.1 


PREDICTED: sphingosine-1 -phosphate phosphotase 2 (SGPP2) 


145.03 


750.85 


5.2 


VTI1A 


NM. 


_1 45206.1 


vesicle transport through interaction with t-SNAREs homolog 1 A (yeast) (VTI1A) 


248.33 


1280.59 


5.2 


DMKN 


NM_ 


_033317.2 


dermokine (ZD52F10), transcript variant 2 


120.28 


616.42 


5.1 


PDGFB 


NM_ 


_033016.1 


platelet-derived growth factor beta polypeptide (PDGFB), transcript variant 2 


92.53 


470.15 


5.1 


FAM83A 


NM. 


_032899.4 


family with sequence similarity 83, member A (FAM83A), transcript variant 1 


129.03 


654.21 


5.1 


PANX2 


NM_ 


_052839.2 


pannexin 2 (PANX2) 


265.21 


1335.89 


5.0 


GST02 


NM_ 


_1 83239.1 


glutathione S-transferase omega 2 (GST02) 


74.51 


367.32 


4.9 


C20orf42 


NM 


017671.3 


chromosome 20 open reading frame 42 (C20orf42) 



PCa i PCaDCLjAPI SRF 






Red numbers indicate fold upregulation, green fields indicate downregulation, black fields indicate the presence of binding sites for the indicated transcription factors, Pea = prostate 
cancer; PCaDLC = Prostate cancer derived cell line; AP1 = Activator protein 1; SRF = Serum response factor 
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Supplementary Table 2 Genes downregulated in NKX3.1 overexpressing LH cells 



GFP NKX3.1 Fold Change Common Genbank 



Description 



PCa t PCaDCLtAPI SRF 




20346.20 346.53 -58.7 TAGLN NM_003186.3 transgelin (TAGLN), transcript variant 2 

8558.84 332.86 -25.7 EEF1A2 NM_001 958.2 eukaryotic translation elongation factor 1 alplia 2 (EEF1A2) 

2350.29 102.48 -22.9 GREM1 NM_013372.5 gremlin 1 , (GREM1 ) BMP antagonist 

2729.31 133.15 -20.5 CDH2 NM_001792.2 cadherin 2, type 1 , N-cadherin (neuronal) (CDH2) 
4163.75 203.80 -204 F0XD1 NM_004472.1 forkhead box D1 (F0XD1 ) 

4763.13 250.86 -19.0 FLNC NM_001458.2 filamin C, gamma (actin binding protein 280) (FLNC) 

1685.00 91.27 -18.5 KISS1 NM_002256.2 KiSS-1 metastasis-suppressor (KISS1 ) 

2828.77 170.25 -16.6 H2BFS NM_017445.1 H2B histone family, member S (H2BFS) 

1669.36 101.83 -16.4 C0L5A1 NM_000093.2 collagen, type V, alpha 1 (C0L5A1) 

2023.86 124.73 -16.2 C0L1A1 NM_000088.2 collagen, type I, alpha 1 (C0L1A1) 

1511.79 95.23 -15.9 IGFBP5 NM_000599.2 insulin-like growth factor binding protein 5 (IGFBP5) 

1245.24 98.50 -12.6 DKK1 NM_01 2242.2 dickkopf homolog 1 (Xenopus laevis) (DKK1) 

9981.32 926.93 -10.8 HIST1H2BK NM_080593.1 histone 1 , H2bk (HIST1 H2BK) 

1898.60 183.20 -10.4 MYL9 NM_006097.3 myosin, light polypeptide 9, regulatory (MYL9), transcript variant 1 
866.92 87.80 -9.9 ANKRD1 NM_014391.2 ankyrin repeat domain 1 (cardiac muscle) (ANKRD1) 

3012.22 312.13 -9.7 UCHL1 NM_004181 .3 ubiquitin carboxyl-terminal esterase LI (ubiquitin thiolesterase) (UCHL1 ) 

1191.03 125.97 -9.5 FLJ90166 NM_153360.1 hypothetical protein FLJ901 66 (FLJ901 66) 
2731.68 320.47 -8.5 HIST1H1C NM_005319.3 histone 1, H1c (HIST1H1C) 
1386.94 167.25 -8.3 C21orf7 NM_020152.2 chromosome 21 open reading frame 7 (C21orf7) 

734.37 91.82 -8.0 HAK NM_052947.2 heart alpha-kinase (HAK) 

1119.88 143.28 -7.8 SAMD9L NM_152703.2 sterile alpha motif domain containing 9-like (SAMD9L) 

2827.18 362.56 -7.8 IFIT3 NM_001549.2 interferon-induced protein with tetratricopeptide repeats 3 (IFIT3) 

4915.30 638.92 -7.7 TPM1 NM_000366.5 tropomyosin 1 (alpha) (TPM1), transcript variant 5 
1592.98 210.41 -7.6 CRIP1 NM_001311.3 cysteine-rich protein 1 (intestinal) (CRIP1) 
1899.13 252.10 -7.5 ZNFX1 NM_021 035.1 zinc finger, NFX1 -type containing 1 (ZNFX1) 

860.45 117.66 -7.3 HIST1H2AC NM_003512.3 histone 1 , H2ac (HIST 1 H2AC) 

1254.04 173.13 -7.2 COL5A2 NM_000393.2 collagen, type V, alpha 2 (COL5A2) 
745.48 103.08 -7.2 CSPG2 NM_004385.2 chondroitin sulfate proteoglycan 2 (versican) (CSPG2) 
926.98 131.84 -7.0 HIST1H2BD NM_138720.1 histone 1 , H2bd (HIST 1 H2BD), transcript variant 2 
659.60 94.04 -7.0 BGN NM_001711.3 biglycan (BGN) 

4342.70 631.33 -6.9 CTGF NM_001901.1 connective tissue growth factor (CTGF) 

3043.84 442.85 -6.9 TNC NM_002160.1 tenascin C (hexabrachion) (TNC) 

5592.26 814.41 -6.9 IFI35 NM_005533.2 interferon-induced protein 35 (IFI35) 

1098.59 164.59 -6.7 HRASLS3 NM_007069.1 HRAS-like suppressor 3 (HRASLS3) 

8826.28 1335.18 -6.6 TAP1 NM_000593.5 transporter 1, ATP-binding cassette, sub-family B (MDR/TAP) (TAP1) 

1 024.41 1 61 .62 -6.3 TNFSF1 0 NM_00381 0.2 tumor necrosis factor (ligand) superfamily, member 1 0 (TNFSF1 0) 

1271.92 203.46 -6.3 DAB2 NM_001343.1 disabled homolog 2, mitogen-responsive phosphoprotein (Drosophila) (DAB2) 

571.48 92.78 -6.2 SIM2 NM_005069.2 single-minded homolog 2 (Drosophila) (SIM2), transcript variant SIM2 

1556.36 256.03 -6.1 TRIM22 NM_006074.2 tripartite motif-containing 22 (TRIM22) 

5505.46 906.28 -6.1 RAGE NM_014226.1 renal tumor antigen (RAGE) 

4791.61 788.84 -6.1 STAT2 NM_005419.2 signal transducer and activator of transcription 2, 1 13kDa (STAT2) 
676.57 112.59 -6.0 NPTX1 NM_002522.2 neuronal pentraxin I (NPTX1 ) 

4485.08 750.68 -6.0 KLF6 NM_0013004 Kruppel-like factor 6 (KLF6), transcript variant 2 

4408.67 764.67 -5.8 THBS1 NM_003246.2 thrombospondin 1 (THBS1) 

2424.90 422.90 -5.7 IFI44L NM_006820.1 interferon-induced protein 44-like (IFI44L) 

888.14 155.10 -5.7 OXTR NM_000916.3 oxytocin receptor (OXTR) 

596.39 104.82 -5.7 ACTG2 NM_001615.3 actin, gamma 2, smooth muscle, enteric (ACTG2) 

1850.60 338.21 -5.5 THY1 NM_006288.2 Thy-1 cell surface antigen (THY1) 

615.68 116.02 -5.3 MYLK NM_005965.3 myosin, light polypeptide kinase (MYLK), transcript variant 6 

705.08 138.13 -5.1 C2orf32 NM_015463.1 chromosome 2 open reading frame 32 (C2orf32) 

2339.28 461.31 5 PLEKHC1 NM_006832.1 pleckstrin homology domain containing, family C member 1 (PLEKHC1) 

Green numbers indicate fold downregulation, red fields indicate upregulation, blacl< fields indicate the presence of binding sites for the indicated transcription factors; Pea = 

prostate cancer; PCaDLC = Prostate cancer derived cell line; AP1 = Activator protein 1; SRF = Serum response factor 





Supplementary Table 3 Overlap between NKX3.1 target genes in mouse and in human cells 









Regulation by NKX3.1 




Regulation by 


Human ID 


Human Symbol 


Entrez Gene Description 


in human cells 


Mouse Symbol 


NKX3.1 in mice 


NM_000067.1 


CA2 


carbonic anhydrase II 


Activated , 


Car2 


Activated ■ 


NM_01 9058.2 


DDIT4 


DNA-damage-inducible transcript 4 


■ctivated m 


Ddit4 


Activated 1 


NM_002155.3 


HSPA6 


heat shock 70kDa protein 6 (HSP70B') 


■ctivated 


Hspa5 


Activated ■ 


NM_005950.1 


MT1G 


metallothionein 10 


Bctivated 


Mtl 


Activated I 


NM_005952.2 


MT1X 


metallothionein IX 


■ctivated Mj^^^^ 


Mtl 


^tivated 


NM_033381.1 


COL4A5 


collagen, type IV, alpha 5 


Activated 


Col4a5 


Repressed J 


NM_014988.1 


LIMCH1 


LIM and calponin homology domains 1 


Repressed 


Limchi 


Activated 


NM_1 53360.1 


APCDD1L 


adenomatosis polyposis coli down-regulated 1-like 


Repressed 


Apcddl 


Repressed ■ 


NM_1 52330.2 


FRMD6 


PERM domain containing 6 


■Repressed 


Frmd6 


Repressed ■ 


NM_006472.1 


TXNIP 


thioredoxin interacting protein 


Repressed 


Txnip 


Repressed ^ 



Ref.: Anderson et al. Journal of Clinical Investigation 2012, 122:1907-1919. 
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Supplementary Table 4 Predicted and actual response to TGFa and NKX3.1 



Name 

LAMAS 

ALDH1A3 

LAD1 

SAA1 

HSPA1A 

LAMC2 

C10orf10 

JAG2 

TFPI2 

LCN2 

CD14 

CASP4 

ADAMS 

PDGFB 

RASD1 

MGST1 

IFIT3 

MYLK 



TGFa response predicted by 
expression edge 



Actual response to NKX3. 1 




Supplementary Table 5 Control of TGFp -related gene expression by NKX3.1 



GPP 



NKX3.1 



Fold Change Common Name Desciption 



901.3701 201.12804 -4.481573529 TGFB1I1 transforming growth factor beta 1 induced transcript 1 (TGFB1I1) 

339.3748 85.09652 -3.9881 16083 TGFB2 transforming growth factor, beta 2 (TGFB2) 

531 .43066 237.8984 -2.233855545 TGFBR2 transforming growth factor, beta receptor II (70/80kDa) (TGFBR2), transcript variant 1 

337.46625 158.84158 -2.124546041 SMAD7 SMAD, mothers against DPP homolog 7 (Drosophila) (SMAD7) 

3151.0513 1573.5414 -2.002522018 TGFBI transforming growth factor, beta-induced, 68kDa (TGFBI) 

209.2047 126.932335 -1.648159234 TGFBRAP1 transforming growth factor, beta receptor associated protein 1 (TGFBRAP1) 

638.00244 389.01 108 -1 .640062386 TGIF2 TGFB-induced factor 2 (TALE family homeobox) (TGIF2) 

125.47178 85.58007 -1.466133178 SI\/IAD6 SMAD, mothers against DPP homolog 6 (Drosophila) (SI\/IAD6) 

677.43225 504.6805 -1 .342299237 SI\/IAD4 SMAD, mothers against DPP homolog 4 (Drosophila) (SMAD4) 

1 18.65842 89.9339 -1 .319395912 TGIF TGFB-induced factor (TALE family homeobox) (TGIF), transcript variant 2 

386.59677 302.59946 -1.277585789 SNIP1 Smad nuclear interacting protein 1 (SNIP1) 

133.12515 107.34483 -1.240163592 SMURF2 SMAD specific E3 ubiquitin protein ligase 2 (SMURF2) 

396.22388 322.08224 -1.230194748 SMAD3 SMAD, mothers against DPP homolog 3 (Drosophila) (SMAD3) 

1 10.44686 90.07587 -1 .226153686 TGIF TGFB-induced factor (TALE family homeobox) (TGIF), transcript variant 4 

1 19.22626 97.336044 -1 .224893216 TGFBR3 transforming growth factor, beta receptor III (betaglycan, 300kDa) (TGFBR3) 

321 .90323 267.21222 -1 .204672563 TGIF TGFB-induced factor (TALE family homeobox) (TGIF), transcript variant 1 

209.43947 186.34435 -1.12393786 SMURF1 SMAD specific E3 ubiquitin protein ligase 1 (SMURF1), transcript variant 2 

114.60477 102.95182 -1.113188383 LOC653804 similar to SMAD, mothers against DPP homolog 2 (Drosophila) (LOC653804) 

141.06203 133.9327 -1.05323069 SMAD2 SMAD, mothers against DPP homolog 2 (Drosophila) (SMAD2), transcript variant 2 

135.20764 131.16252 -1.030840518 TGFBR1 transforming growth factor, beta receptor I (TGFBR1) 

114.35846 111.279144 -1.027671996 TGFBI transforming growth factor, beta 1 (TGFBI) 

106.51333 105.51177 -1.009492401 SMADI SMAD, mothers against DPP homolog 1 (Drosophila) (SMAD1), transcript variant 2 

106.71678 1 18.3267 1 .108791888 TAIP-2 TGF-beta induced apotosis protein 2 (TAIP-2) 

425.55972 510.4316 1.199435886 SI\/IAD5 SMAD, mothers against DPP homolog 5 (Drosophila) (SI\/IAD5), transcript variant 1 

1211.3107 1468.2693 1.212132692 TINP1 TGF beta-inducible nuclear protein 1 (TINP1) 

I mRNAs downregulated >1. 5-fold 



Supplementary Table 6 Raw hybridization values for TNFSF10/TRAIL of duplicate samples 



GFP 


NKX3.1 


7 h 


lOh 


7h 


lOh 


TNFSFIO/T 1058.53 


990.92 


1076.59 


1139.69 


169.59 


153.63 


168.56 


132.50 
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